Adding Emotions to Malay Synthesized Speech Using Diphone-based Templates

نویسندگان

  • Syaheerah L. Lutfi
  • Raja Noor Ainon
  • Salimah Mokhtar
  • Zuraidah M. Don
چکیده

This paper concerns the addition of an affective component to Fasih, one of the first Malay Textto-Speech systems developed by MIMOS Berhad. The goal is to introduce a new method of incorporating emotions to Fasih by building an emotions filter that is template-driven. The templates are diphone-based emotional templates that can portray four types of emotions, i.e. anger, sadness, happiness and fear. A preliminary experiment that focused on showed that the recognition rate of Malay synthesized speech is over 60% for anger and sadness.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Template-driven Emotions Generation in Malay Text-to-Speech: A Preliminary Experiment

This paper describes the pilot experiment conducted for the purpose of adding an affective component to the first Malay Text-to-Speech (TTS) system, Fasih. The aim is to test a new method of generating an expressive speech via a template-driven system based on diphones as the basic sound. The synthesized expressive speech can express four types of emotion. However, as an initial test the pilot ...

متن کامل

Integrating rule and template-based approaches for emotional Malay speech synthesis

The manipulation of prosody, including pitch, duration and intensity, is one of the leading approaches in synthesizing emotion. This paper reports work on the development of a Malay Emotional synthesizer capable of expressing four basic emotions, namely happiness, anger, sadness and fear for any form of text input with various intonation patterns using the prosody manipulation principle. The sy...

متن کامل

Complex Emotions - the Simultaneous Simulation of Emotion-related States in Synthesized Speech

We describe an approach to simulate first and secondary emotional expression in synthesized speech simultaneously by targeting different parameter categories. The approach is based on the open-source system “Emofilt” which utilizes the diphone-synthesizer “Mbrola”. The evaluation of the approach by a perception experiment showed that the pure emotions were all recognized above chance. Whereas t...

متن کامل

Prosodic Analysis and Modelling for Malay Emotional Speech Synthesis

This paper discusses an emotional prosody generator for a Malay speech synthesis system that can re-synthesize the selected vocal emotion from neutral synthesized speech output and improve the naturalness by adopting rulebased prosody conversion techniques. The role of prosodic features in emotional expression, particularly fundamental frequency and duration, has been widely investigated in sev...

متن کامل

مراحل و نحوه ی تهیه ی دادگان های صوتی هجایی و دایفونی برای سامانه ی تبدیل متن به گفتار فارسی

Abstract Speech databases are part of the concatenative text to speech synthesis systems. Phonetic quality of the databases plays a significant role in the naturalness of the synthesized speech. This paper introduces two syllable and diphone speech databases for Persian and investigates the way of their development and their specifications and their advantages to each other. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005